Modification of CHF and BIC coefficients for Evaluation of Clustering with Mixed Type Variables

نویسندگان

  • Tomas Löster
  • Tomas Pavelka
چکیده

Cluster analysis is a multivariate statistical method, which is used to classify objects. It is used in many areas, such as the classification of customers or respondents in various marketing surveys. Individual objects are characterized by different variables. Variables can be quantitative and qualitative. Depending on the type of variables it is necessary to select the appropriate method of measuring distances of objects and clusters. There are many ways how to measure these distances and it is not clearly defined how to choose specific measure in different conditions. Depending on the extent of distances and the method chosen may arise different clusters, and thus different results. For this reason, it is necessary to evaluate the clustering result. The evaluation should analyze the numbers of clusters and different clustering methods. There are many coefficients for evaluate results of clustering. In the current literature are defined in particular coefficients, which are used for the quantitative variables. For variables of mixed types (a combination of qualitative and quantitative) are coefficients described only in a very limited extent. The aim of this paper is to analyze the modified coefficients CHF and BIC on real data sets in case of mixed types variables.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mixture-model cluster analysis using information theoretical criteria

The estimation of mixture models has been proposed for quite some time as an approach for cluster analysis. Several variants of the Expectation-Maximization algorithm are currently available for this purpose. Estimation of mixture models simultaneously allows the determination of the number of clusters and yields distributional parameters for clustering base variables. There are several informa...

متن کامل

The validity and reliability of the Brief Fear of Negative Evaluation Scale in women with tension-type headaches

Aim and Background: Examining fear of negative evaluation as one of psychological causes in headaches is important. The aim of the present study was to investigate validation and psychometric properties of the Brief Fear of Negative Evaluation Scale (BFNES, Leary, 1983) in a group of women with tension-type headaches. Methods and Materials: A total of 110 women with tension-type headaches in ...

متن کامل

Modification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis

Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...

متن کامل

Mean Activity Coefficients Measurements and Thermodynamic Modeling of the Ternary Mixed Electrolyte KCl + Lactose + Water System at T = 298.15 K

In this work, the mean activity coefficients of KCl in the KCl+lactose +water system were determined using the potentiometric method. The electromotive force measurements were carried out on the galvanic cell without liquid junction of the type: Ag|AgCl|KCl (m), lactose (wt.%), H2O (1−wt.) %|K-ISE, in various mixed solvent systems containing 0, 5,7.5, 10 and 12.5 % mass fractions of lactose. Th...

متن کامل

Cost Effective Heat Exchanger Network Design with Mixed Materials of Construction

This paper presents a simple methodology for cost estimation of a near optimal heat exchanger network, which comprises mixed materials of construction. Intraditional pinch technology and mathematical programming it is usually assumed that all heat exchangers in a network obey a single cost model. This implies that all heat exchangers  in a network are of the same type and use the same mate...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013